Lexical Semantic Ambiguity Resolution with Bigram-Based Decision Trees
نویسنده
چکیده
This paper presents a corpus-based approach to word sense disambiguation where a decision tree assigns a sense to an ambiguous word based on the bigrams that occur nearby. This approach is evaluated using the sense-tagged corpora from the 1998 SENSEVAL word sense disambiguation exercise. It is more accurate than the average results reported for 30 of 36 words, and is more accurate than the best results for 19 of 36 words.
منابع مشابه
Evaluating the Effectiveness of Ensembles of Decision Trees in Disambiguating Senseval Lexical Samples
This paper presents an evaluation of an ensemble–based system that participated in the English and Spanish lexical sample tasks of SENSEVAL-2. The system combines decision trees of unigrams, bigrams, and co–occurrences into a single classifier. The analysis is extended to include the SENSEVAL-1 data.
متن کاملSemantic Priming Effect on Relative Clause Attachment Ambiguity Resolution in L2
This study examined whether processing ambiguous sentences containing relative clauses (RCs) following a complex determiner phrase (DP) by Persian-speaking learners of L2 English with different proficiency and working memory capacities (WMCs) is affected by semantic priming. The semantic relationship studied was one between the subject/verb of the main clause and one of the DPs in the complex D...
متن کاملAmbiguity and synonymy effects in lexical decision, naming, and semantic categorization tasks: interactions between orthography, phonology, and semantics.
In this article, ambiguity and synonymy effects were examined in lexical decision, naming, and semantic categorization tasks. Whereas the typical ambiguity advantage was observed in lexical decision and naming, an ambiguity disadvantage was observed in semantic categorization. In addition, a synonymy effect (slower latencies for words with many synonyms than for words with few synonyms) was obs...
متن کاملEvaluating the Effectiveness of Ensembles of Decision Trees
This paper presents an evaluation of an ensemble–based system that participated in the English and Spanish lexical sample tasks of SENSEVAL-2. The system combines decision trees of unigrams, bigrams, and co–occurrences into a single classifier. The analysis is extended to include the SENSEVAL-1 data.
متن کاملVoice Assimilation Phenomenon and Its Implementation in LVCSR System with Lexical Tree and Bigram Language Model
In this paper a LVCSR system with implementation of the Czech voice assimilation phenomenon is proposed. The recognition system uses lexical trees and a bigram language model. The first part of this article is focused on voice assimilation phenomenon description, triphone lexical tree construction, and voice assimilation impact on LVCSR system performance. The second part outlines lexical tree ...
متن کامل